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SYSTEM FOR DOWNLOADING TELEVISION INFORMATION TO 
EXTERNAL DEVICES AND MEDIA 

FIELD OF THE INVENTION 

The present invention relates to interactive television and more 
particularly to interactive television services that enable downloading of television 
information to external devices and media. 

i 

BACKGROUND OF THE INVENTION 
Downloading via the Internet tunes for use as telephone rings for 
cellular telephones is well known in the art Also known in the ait is downloading 
images to cellular telephones via the Internet 

Some aspects of technologies and related art that may be useful in 
understanding the present invention are described in the following publications: 

US Patent 5,606,365 to Mauritius et al, which describes an interactive 
camera for network processing of captured images; 

Published US Patent application 2003/0167469 of Hardingham et al, 
which describes a method and system of offering for sale Images related to a 
television program via interactive television; and 

Published PCT patent application WO 99/59339, which describes an 
interactive television system for selectively retrieving video images or teletext pages 
from a remote transmitter; 

Published US Patent application 2003/0093786 of Amsellera, which 
describes an interactive television method and device; 

Published PCT patent application WO 03/055217, which describes an 
interactive television commerce system mat includes a video broadcast distribution 
network having multiple in-band and out-band channels; 

Published PCT patent application WO 02/082796, which describes a 
video image display system mat is used for interactive television, videoconferencing 
andtelemonltoring; and 



Published US Patent application 2003/0011636 of Feroglia et al, 
which describes a method for magnifying content 

The disclosures of all references mentioned above and throughout the 
specification, as well as die disclosures of all references mentioned in those 
S references* are hereby incorporated herein by reference. 
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SUMMARY OF THE INVENTION 
The present invention seeks to provide a system and method for 
enabling individual determination and selection of information, such as images and 
rings, to be downloaded from television, and for downloading the individually 

5 determined information to external devices and media. Today, individual choice of 
rings and images is typically not available because users cannot choose rings and 
images that were not previously determined and placed in databases by providers of 
the rings and images. Enabling individual determination and selection of rings and 
images to be downloaded makes it possible for users to have an individual "touch" 

10 with regards to usage of the external devices and media and is therefore considered 
desirable. 

The external devices and media may include, for example, mobile (or 
cellular) telephones and other personal computing, entertaining and communicating 
devices. By way of example, the description below refers to downloading rings and 

IS images from an entertainment television channel. The rings and the images are 
comprised in audio/video (A/V) sections. However, it is appreciated that the present 
invention is not limited to rings and images or to a specific channel or content 
provider, and a service for downloading rings and images from television (a 
"snapshot service**) may 4 fit* many broadcasters (assuming they have the right type 

20 of content). 

The present invention also seeks to enable quick and sumple use of the 
service because users are expected to act on impulse. 

The present invention further seeks to leave a target audience as much 
as possible in control of a final image and/or ring downloaded to their external 
25 devices. 

The terms "tune", "ring" "polyphonic ring", and "ring tone" in 
singular or plural are interchangeably used throughout the present specification and 
claims to refer to music or sounds that may be used as a telephone ring or as 
background sounds to an application. 
30 Preferably, the service is an Integral part of watching television, that 

is, a user may decide to download tunes and images while viewing, for example, the 



latest times presented in glamorous music videos on the entertainment television 
channel. 

Taking advantage of interactive capabilities of interactive television 
(ITV), viewers may be offered a high degree of freedom. Preferably, the viewers are 

5 offered toe opportunity to take a 'snapshot 1 of what they believe is the most "cool" 
background image for their mobile telephone whether it is an image of their favorite 
pop star, a cartoon image or any other preferred image, including, inter alia, a 
background environment such as a landscape view. When a viewer presses, for 
example, a key on a remote control, a flashy blitz effect preferably appears on a 

10 television screen and a few minutes later the viewer preferably receives the 
background image on his or her mobile phone* 

It is appreciated that allowing individual creation and flashing of an 
image may be attractive to many viewers that are also users of cellular telephones 
and may enable them to add individual "statements" and/or "touch" to ways in 

IS which the users use their cellular telephones. Such individual statements and touch 
cannot be achieved by letting the viewers select between a limited range of 
background images of which they have no control 

Similar freedom is preferably offered in defining polyphonic ring 
tones (rings). The viewers are preferably enabled to "record" tunes of their liking and 

20 preference, for example by pressing 'start' and 'stop 9 at any point during a music 
video* 

A snapshot service may also be offered to viewers of non-interactive 
television, such as viewers of conventional analog terrestrial television. Such a 
snapshot service is referred to throughout the present specification and claims as an 
25 "analog snapshot service". The analog snapshot service makes use of a telephone 
and does not require a set-top box (STB) as in a snapshot service that uses ITV. The 
analog snapshot service is therefore suitable for larger audiences and larger varieties 
of markets than a snapshot service that uses ITV* 

In the analog snapshot service, a viewer that watches television may 
30 press a telephone key at an instant when a desired image appears on television. A 
voice response system preferably registers the exact time the viewer pressed the key, 
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and based on such a registered time, a video frame including the desired image may 
be determined and transmitted to a mobile telephone of the viewer. 

There is thus provided in accordance with a preferred embodiment of 
the present invention a method for selecting an AN section from an AN program 
rendered on a television, the method including pointing at die AN section 
substantially when the AN section is rendered on the television, and identifying the 
AN section based on when the pointing was carried out with respect to rendering 
progress of the AN program on the television. 

Further in accordance with a preferred embodiment of the present 
invention there is provided apparatus for selecting an A/V section from an AN 
program rendered on a television, the apparatus including a pointer enabling 
pointing at the AN section substantially when the AN section is rendered on the 
television, and a processor identifying the AN section based on when pointing of 
the AN section was carried out with respect to tendering progress of the AN 
program on the television. 



BRIEF DESCRIPTION OF THE DRAWINGS 
The present invention will be understood and appreciated more ftilly 
from the following detailed description, taken in conjunction with the drawings in 
which; 

5 Figs. 1 - 3 are simplified pictorial illustrations depicting television 

screens displayed to a user during offering and using a snapshot service; 

Figs* 4 and 5 are simplified pictorial illustrations depicting display 
screens displayed to a user during editing of a downloaded background image; 

Fig. 6 is a simplified pictorial illustration depicting a television screen 
1 0 displayed to a user during editing of a polyphonic ring; 

Figs. 7A and 7B together constitute a simplified block diagram 
illustration of a system for providing a snapshot service for a case where television 
information from which snapshots are to be taken comprises television information 
in an MPEG digital format; 
IS Figs. 7C and 7D together constitute a simplified block diagram 

illustration of a system for providing a snapshot service for a case where television 
information from which snapshots are to be taken comprises analog television 
information; 

Fig, 8 is a simplified block diagram illustration of a system for 
20 providing a snapshot service in which there are various latencies; 

Fig. 9A is a simplified block diagram illustration of a frame recorder 
indicated "iCapture" in the system of Figs- 7A and 7B; and 

Fig. 9B is a simplified block diagram illustration of a frame recorder 
indicated "iCaptute" In the system of Figs. 7C and 7D. 
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DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 
1 Concept 

A snapshot service enables a user to Individually determine and 
select information to be downloaded from a television, such as images and rings, 
S and to download the images and rings to an external device or external media The 
external device may include, for example, any one of the following: a mobile 
(cellular) telephone; a computing device; an entertainment device; and a 
communication device, The external media may include media in an external 
device or media attached to an external device, such as a memory stick that may be 
10 associated with an external device comprising, for example, a computing device 
such as a personal computer (PC). For simplicity, the description below refers to a 
mobile telephone as the external device to which the information is. downloaded 
from the television, but it is appreciated that the present invention is not limited to 
mobile telephones as external devices. 
IS It is appreciated that individual determination and selection of the 

information to be downloaded from the television allows the user to create an 
individual statement - both in terms of visuals and audio. Preferably, individual 
determination and selection of the information to be downloaded from the 
television is enabled by allowing the user to take . 'snapshots' of background 
20 images and polyphonic ring tones by simple key pressing, for example, on a 
remote control The background images and the polyphonic ring tones are 
comprised in audio/video (AAO sections and the user may select an A/V section 
from an A/V program rendered on the television by pointing at the A/V section 
substantially when the A/V section is rendered on the television* The A/V section 
25 is then identified based on when the pointing was carried out with respect to 
rendering progress of the A/V program on the television. 

The snapshots may be transmitted to mobile telephones* 
Alternatively, the snapshots may bo transmitted to media other than mobile 
telephones (as part of the very same transmission mechanism) and used 
30 respectively, for example, as wallpaper and start-up sounds for PCs. 
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In an analog snapshot service, users of non-interactive television, 
such as analog broadcast television, may use telephones to make individual 
determination and selection of information to be downloaded from television. A 
key press on a telephone keypad at a time a desired image is displayed is 
5 preferably recorded by a voice response system and used to determine a video 
frame that includes the desired image. The desired image may then be transmitted 
to mobile telephones or to media other than mobile telephones for use, for 
example, as wallpaper and start-up sounds. 

1,1 Background Images 

10 Referring, for example, to a snapshot service that uses ITV, an 

example of a screen on a television offering a user an opportunity to download a 
background image is desoribed below with reference to Fig. !. Being a fan of, for 
example, a singer, the user may want to 'snap* the picture of the singer as she 
performs live, for example on an entertainment television channel, for use as a 
IS background image on his/her mobile telephone. Such an opportunity is preferably 
indicated/promoted, for example, by displaying an icon and a remote control 
reference on the television screen as shown in Fig. I . 

Preferably, when the user presses a dedicated key on a remote 
control, e.g., the yellow key, the key press is translated to an instruction to 
20 instantly take a 'snap' of the singer picture. Some flashy visual blitz effect as 
illustrated in Fig. 2 may preferably indicate a picture being currently taken - only 
visible for a very short time equivalent to a couple of frames and preferably 
starting from ti\e picture frame being snapped. 

ft is appreciated that although frame accuracy cannot typically be 
25 guaranteed, it may come very close, Displaying the flashy effect generally 
resembles flash usage in a conventional film camera. Such an effect may be used 
to train the user to capture the exact 'moment* as is the case in conventional film 
cameras. Since many users have learned to operate film cameras in a satisfactory 
way, it is anticipated that taking a snapshot may also be learned to provide 
30 satisfactory results. 
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As soon as a picture has been taken (snapped), a message is 
preferably prompted on the television screen as shown in Fig. 3. The message 
preferably invites the user to enter a number of his/her mobile telephone where the 
background image is to be downloaded 

5 Preferably, a number of validation and confirmation pop-up screens 

are usedl>efore submitting, via a return path provided by a modem (not shown) of 
a set-top box (STB) (not shown) associated with the television, a request to 
download the snapped picture. The pop-up screens may preferably form part of a 
procedure for order confirmation and monetary transaction for payment for the 

10 service. 

After the user enters the number of his/her mobile telephone and 
confirms his/her request and payment, the request is preferably transmitted via the 
modem to a backend installation of the service, such as a server. Hie server 
preferably transmits a desired background image resulting from the snapped 
IS picture to the specified mobile telephone embedded in at least one of the 
following: an MMS (Multimedia Messaging Service) message; a WAP (Wireless 
Application Protocol) push massage; and a JAVA™ application. If the background 
image is embedded in a JAVA application, simple editing of the background image 
is supported on the mobile telephone. Specifically, the user can crop an area of the 
20 background image as shown in Fig. 4. 

It is appreciated that the user may keep the background image as 
received from the server* or edit the background image in which case the user may 
go for, as depicted in Fig. 4, a different rendering of the singer (<CROP*)» If the 
user decides to edit the background image, the user may preferably select 'CROP' 
25 to bring forward a proportional focus area that can be moved around the display 
screen by pressing conventional arrow keys. Also, by pressing another set of keys, 
such as and if available, it may be possible to reduce or enlarge the focus 
area as shown in Fig. 5. 

If the background image is accepted as is, no fUrther communication 
30 is required* However, in a case where the background image is edited, another 
transaction is needed. Preferably, editing of the background image returns a set of 
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coordinates to the server, which will then send a new background image as 
specified by the user. 

If the background image Is delivered in an MMS message or a WAP 
push message, editing is not supported on the mobile telephone. Instead, editing 
can be handled on the STB. In such a case, the background image selected by the 
user is returned to the STB via the modem as a bitmap or MPEG still image using 
the very same return path, and the user may edit the bitmap or MPEG still image 
when displayed by the STB on the television screen. When the user terminates 
editing of the background imago, coordinates resulting from the editing are 
preferably communicated from the STB to the server via the return path, and the 
server processes the background image and the coordinates to produce an edited 
background image which is then sent to the mobile telephone. 

The analog snapshot service differs from the snapshot service that 
uses rrv in a way snapshots are ordered, determined and taken. An analog 
snapshot session is typically initiated by die user using a mobile telephone to dial a 
telephone number which is provided by a provider of the analog snapshot service 
and is displayed on the screen of the television when the analog snapshot service is 
made available. A telephone call thus initiated by the user is preferably responded 
by a voice response system. After, typically, an introduction and an explanation of 
the service, and input, by the user, of required information typically including the 
user's mobile telephone number, the voice response system Is ready to accept an 
image selection from the user. The voice response system then preferably prompts 
the user to press a key on a telephone keypad when the user notes a desired image 
on (he screen of the television. 

It is appreciated that the required information may also include 
monetary information or information indicating an agreement by the user to pay 
for the service. In such a case, the monetary Information or the information 
indicating an agreement by the user to pay for the service may he validated by the 
voice response system or by a validation system (not shown) associated with the 
voice response system as part of a monetary transaction as is well known in the art. 
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Typically, the user remains on-line and waits until die desired image 
is displayed on the television screen. The user may then press the key on the 
telephone keypad and the voice response system preferably records a time when 
the key press was made. The time when the key press was made is then matched to 

5 the desired image, and the desired image is preferably transmitted by the server to 
the user's mobile telephone as in the case of the snapshot service that uses ITV. 
The user may men maintain the image or edit the image as m the case of the 
snapshot service that uses ITV. If the user edits the image, the user is typically 
required to initiate a new session to obtain another image that corresponds to a 

10 required editing. 

1.2 Polyphonic Ring Tones 

Preferably, recording, downloading and editing of polyphonic ring 
tones works similarly from a conceptual point of view. Thus, it is possible to use 
almost the same conceptual frameworks as described above for 'snapshots' of 
IS sound for the snapshot service that uses ITV and for the analog snapshot service, 
respectively. 

Referring now back to Fig. 1, the user is preferably able to 'start' 
and 'stop' recording of, for example, at least a part of a song performed by the 
singer live on the entertainment television channel for use as a polyphonic ring 

20 tone on his/her mobile telephone. The recording may be started and stopped by the 
user pressing another dedicated key on the remote control, e.g., the blue key. One 
press on the blue key may be used to start the recording, and another press on the 
blue key may be used to stop recording. An Incrementing bar may preferably be 
displayed while recording, making sure that the user has an easy-to-read indication 

25 of the maximum length of the recording. At the same time, it is a clear indication 
that the recording has started. A timeout period may preferably be determined so 
that if the user does not press the key to stop the recording, the recording will be 
terminated automatically after the timeout period. 

Similar to background images, the polyphonic ring tone may 

30 preferably be transmitted to the mobile telephone embedded in a JAVA 
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application, allowing for simple editing by adjusting start and end points with 
arrow keys of the mobile telephone as shown in Fig. 6. It is appreciated that if the 
mobile telephone has the capability of implementing editing changes to the 
polyphonic ring tone, no ftuther communication is required once the editing is 

5 finished. However, if the mobile telephone does not have the capability of 
implementing the editing changes to the polyphonic ring tone, further 
communication with the server is required to notify the server of the editing 
changes required in which case the server preferably actually implements the 
editing changes and transmits an edited version of the polyphonic ring tone back to 

10 the mobile telephone. 

In the analog snapshot service, the recording may be started and 
stopped by the user pressing different keys on the telephone keypad at respective 
times. 

13 Other Downloads 

15 Depending on mobile telephone capabilities, it may be possible to 

download more than one video frame, such as an entire video clip or commercial* 
In such a case, the user may download video snapshots by using the 'start* and 
'stop' options for starting and ending recording respectively as in the case of ring 
tone recording. In general, the present invention enables the concept of 'An item 

20 that can be 'snapped' can be downloaded' for both the snapshot service that uses 
ITV and the analog snapshot service. 

1.4 Other Media 

The present invention also preferably enables distribution of image 
or sound snapshots to media other than a mobile telephone. For example, the user 
25 may send a snapshot in an e-mail message to Ms/her own e-mail account or to a 
friend. The snapshot may then be used as a desktop image or a start-up sound 
respectively. It is also possible to set-up related websites where users can submit 
their snapshots for others to check out (but not copy), participate in competitions, 
etc. 
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1.5 Branding 

The snapshots offer an opportunity to increase brand loyalty of 
viewers/users because , like pictures in an album users may want to show 
downloaded snapshots to other people. To further increase brand loyalty > any 
5 snapshot delivered by the service may preferably be 'stamped' with a clear 
rendering of a broadcaster's logo, making sure that nobody has any doubts on 
where to go for such a snapshot. 

1.6 Payments for the Service 

As mentioned above, a monetary transaction may be carried out for 
10 payment for die service. Payments may be in die form of micro-payments as is 
well known in the art It is appreciated that for promotion purposes some snapshot 
downloads may be for free whereas others may be downloaded for a price. It is 
further appreciated that snapshots from different television programs may be 
differently priced. 

IS In the analog snapshot service, the user may remain on-line for 

relatively long time periods until die desired image is displayed on the television 
screen. Thus, if the user is also billed for on-line calls to the voice response 
system, such billing may increase the price of a snapshot download; 

2 Owners 9 Rights of Downloaded Snapshots 

20 Broadcasters that are owners of television information that is subject 

to downloading as snapshots may charge for taking snapshots of their own contont. 
For broadcast material whose rights are not owned by the broadcasters, agreements 
may be reached in which either the rights for offering the material within the 
service are acquired by the broadcasters, or revenues from downloading the 

25 snapshots are split between owners of the rights and the broadcasters. Each 
program offered In the service may preferably include a code identifying a rights 
owner that is entitled to at least a portion of payments made by users downloading 
snapshots from the program. 
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It is appreciated that if a program is not offered in the service for 
downloading snapshots, the program may be assigned a metadata code or Hag that 
prevents taking and downloading snapshots thus "turning off* the service tor the 
program. Such "censoring* of the service is typically done by conventional 
5 broadcast scheduling systems (not shown). 

The term "metadata" is used throughout the specification and claims 
to include information descriptive of or otherwise referring to an interactive 
content stream. The information referring to the interactive content stream may 
include, for example, pointers, tags, flags and indexing information. At least part 
10 of the information referring to the interactive content stream may be used to enable 
or disable interactive operations depending, for example, on values assigned to the 
tags or flags. In a case where the interactive content stream is associated with at 
least one television program, the pointers, tags, flags and indexing information 
may point to and index segments of the at least one television program and 
IS portions of the interactive content stream. It is appreciated that the contents of the 
metadata may preferably pertain to either a plurality of users or an individual user. 
The metadata may be used to enable operations on at least one television program, 
if associated with the interactive content stream, and/or on interactive content in 
the interactive content stream. 
20 The metadata may also be useful in billing users for downloading 

snapshots of background images and polyphonic ring tones because the metadata 
identifies a television program from which the snapshots have been downloaded. 
Identification of the program enables the service to credit a rights owner as 
mentioned above. It is appreciated (hat the metadata may also include indications, 
25 for example through tags, of different pricing for snapshots taken from different 
television programs. Preferably, the different pricing is presented to the users 
before submitting requests to download snapshots. 

3 Technology 

A system for downloading television information individually 
30 determined and selected by a user to an external device is described below with 
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reference to Figs. 7A and 7B for a case where the television information comprises 
television information in an MPEG digital format A corresponding system for a 
case where the television information comprises analog television information is 
described below with reference to Figs. 7C and 7D. 

5 3*1 Storing Frames 

Fig. 7A depicts elements at a headend that broadcasts television 
programs in an MPEG format To store video frames* a frame recorder indicated 
"iCapture n is preferably utilised at the headend. Basically, iCapture captures a live 
video stream of a television program, decodes the video stream into frames and 
1 0 stores each of the frames in a frame database that is preferably sufficiently large to 
hold a significant history of frames. Preferably, iCapture attaches to each frame a 
frame identifier that uniquely identifies the frame when retrieving frames. A more 
detailed description of iCapture is provided below in Section 3.7. 

It is appreciated that a required storage capacity of thfe frame 
IS database may be reduced if iCapture is enabled to store MPEG sequences in an 
MPEG format instead of discrete frames. An MPEG sequence is the smallest piece 
of the MPEG stream that can be decoded without any additional information. 
Storage of MPEG sequences is generally more economical than storage of discrete 
frames. Additionally, extraction of images from an MPEG sequence is a fast and 
20 simple operation, and decoding of the MPEG sequence is relatively simple. 

Fig. 7C depicts elements at a headend that broadcasts television 
programs in an analog format. The headend depicted in Fig. 7C also includes an 
iCapture element* but the iCapture of Fig. 7C receives streamed video frames in an 
analog format as input and outputs, for storage in the frame database, discrete 
25 frames in an image format that offers a good quality, such as, for example, the 
PNG format 

3.2 Retrieving Frames 

A conventional application server is preferably responsible for 
retrieving frames* The application server used in the system of Figs. 7A and 7B 
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may be similar in structure and function to the application server used in the 
system of Figs. 7C and 7D. Hie application server is preferably connected to the 
frame database and it processes requests from STBs and mobile telephones and 
generates replies to the mobile telephones. The requests preferably include frame 

5 identifiers and the replies preferably include the corresponding frames. It is 
appreciated that the application server preferably supports and serves requests 
from a plurality of users simultaneously. 

The replies are preferably transmitted through an MMS gateway. 
The MMS gateway may also receive requests for additional frames from the 

10 mobile telephones and transmit them to the application server. 

33 Unique Identification of Frames 

When the user uses the service and a frame is selected the server 
must preferably be notified which frame was selected. Preferably, a unique 
identification (UID) of the frames is constructed in order to enable notification of 
15 selected frames to the saver. 

3.3.1 Identification of Frames in the Snapshot Service that Uses ITV 

3.3.1.1 First Identification Mode 

The UID preferably includes three components: a time code; a frame 
counter; and a discontinuation counter. 
20 The Time Code 

The time code is an integral part of a conventional MPEG video 
stream. The time code is attached approximately every 1/2 second. 

The Frame Counter 

The frame counter specifies how many frames have passed since the 
25 latest change in the time code was detected. This counter Is implemented by 
monitoring the time codes in an STB application. The latest received time code is 
inspected once every 1/25 second. If a new time code is found, it is stored and the 
frame counter is set to zero. If the same time code is received again, the frame 
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counter is incremented. A similar computation of the frame counter is performed 
by i Capture, so mat a time code and a frame counter together identify each frame. 

The frame counter is based on an STB's clock. There is a risk that 
the STB's clock might drift slightly so that a frame cannot be identified with 100 
5 percent accuracy. However, latency in the user's reaction time is expected to be 
much higher than such a drift and therefore the STB's clock drift is typically 
acceptable. 

The Discontinuation Counter 

The combination of time code and frame counter is, however, not 

10 enough to make a complete unique identification of a frame. The problem is that 
the time code may have discontinuities. Discontinuities may be caused by 
changing or rebooting of MPEG encoding equipment at the headend and may also 
occur if pieces of pro-encoded MPEG video are comprised in broadcasted video 
material. The problem with discontinuities is that two different frames may have 

IS the same time code. To distinguish between two different frames with the same 
time code, discontinuities may preferably be counted m both iCapture and the STB 
application. In other words, a discontinuation counter is preferably added to the 
UID. The discontinuation counter is incremented each time the time code in the 
broadcast video material is not continuous. To make it possible for the user to 

20 change channels whenever he/she desires, iCapture preferably broadcasts a current 
discontinuation counter value on a separate data stream encoded, for example and 
without limiting the description, as a document in bDOM which is commercially 
available from NDS Ltd., One London Road, Staines, Middlesex TW18 4EX, 
United Kingdom. 

25 After a period of monitoring the discontinuation counter and the 

time codes being broadcast, the STB application is typically fully synchronised 
with the server and the unique identification is preferably determined. 
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33.1.2 Second Identification Mode 

The second identification mode can be used in cases where the first 
identification mode is not viable due to limitations in headend equipment or STB 
, middleware. 

5 The second identification mode is particularly useful in a case where 

the snapshot service uses 1TV in which it is possible to broadcast information with a 
constant and fixed broadcast latency. STBs useful for such ITV may include any 
appropriate STBs such as, for example and without limiting the generality of the 
invention, STBs of a Media Highway 71 * 4 system which is commercially available 

10 from NDS Limited, One London Road, Staines, Middlesex, TW18 4EX, United 
Kingdom. 

In such a case, a unique number may be broadcast from iCapture to an 
STB, one specific example of which is a Media Highway™ STB. Since the 
broadcast latency Is constant, the STB is able to compute excatly when the number 
IS was sent from ICapture, and it can therefore compute a unique identification based 
on the latest received number and the time that has passed since the latest received 
number was received. Identical computations can also be performed by ICapture, 
and it is therefore possible to compare results of such computations for correct 
identification of a desired background image. 

20 3.3.2 Identification of Frames in the Analog Snapshot Service 

In tite analog snapshot service, a determination of a frame (hat 
includes a desired image is based on an exact determination, as recorded in me voice 
response system, of the time when the user has pressed the key on the telephone 
keypad while viewing a broadcast program. 
25 Preferably, iCapture tags each frame with an indication of the exact 

time the frame was received in a video transmission. In an analog video transmission 
frame tagging is based on an external clock that is used to identify excatly when 
. iCapture has received the frame. Once the frames are tagged, iCapture may operate 
as in the case of the snapshot service that uses ITV to respond to a request for 
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retrieving a frame (hat is provided by the voice response system based on die time 
when the user has pressed the key on the telephone keypad. 

3«4 User Interface 

3,4.1 User Interface in the Snapshot Service that Uses TTV 

5 hi the snapshot service that uses ITV the user interfaces with the STB 

through the remote control and menu screens. The menu screens enable the user to 
select the service, input a target telephone number, and to select an A/V section from 
an A/V program rendered on television by pointing, for example through pressing a 
key on the remote control, at the A/V section substantially when the A/V section is 

10 rendered on the television. The A/V section may, for example, include an image or a 
polyphonic ring tone. 

After selection of a desired A/V section for downloading, the user 
may also use menu screens to approve the selection and a payment, and to allow the 
selection to be transmitted to the headend. The A/V section is identified by a 

IS processor (not shown) at the headend based on when pointing of the A/V section 
was carried out with respect to rendering progress of the A/V program on the 
television. 



3.4.2 User Interface in the Analog Snapshot Service 

20 In the analog snapshot service, the user interface is provided by the 

voice response system. The voice response system may preferably be implemented 
in any appropriate combination of hardware and software. 

Preferably, the user dials a telephone number of the voice response 
system which is provided by a provider of the analog snapshot service. The 

25 telephone number of the voice response system may preferably be displayed on the 
screen of the television when the analog snapshot service is made available. 

The voice response system preferably responds to a telephone call 
initiated by die user by presenting the service and requesting the user to enter user 
input information that is necessary for the service. The user input information may. 
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for example, include the following: the user's mobile telephone number; a source of 
a video signal received by the user; the user's mobile telephone model; an indication 
indicating a mobile operator that provides cellular services to the user's mobile 
telephone; and a target mobile telephone number. 
5 The source of the video signal may typically be a cable source or a 

satellite transmission source* Knowledge of the source is important In order to 
determine latencies as described below. 

In a case where the user's mobile telephone model may be 
autodetected, a requirement to enter the information regarding the mobile telephone 
1 0 model may be optional. 

The target mobile telephone number is required only in a case where 
the user intends to send a snapshot to a mobile telephone otter than his/hers. 
Otherwise* the user's mobile telephone number is used as a target for the snapshot 

It is appreciated that much of the user input information mentioned 
15 above may be provided once and stored at die voice response system for future 
sessions of snapshot downloading. 

After accepting and processing the user input information, die voice 
response system may notify the user that the system is ready to accept a selection of 
a desired image for downloading as a snapshot. The voice response system then 
20 preferably invites the user to press a key on a keypad of the mobile telephone when • 
the user notes a desired image on the screen of the television. 

Preferably, when the voice response system detects that the user has 
pressed the key on the mobile telephone keypad, the voice response system 
generates a request with the exact time the key was pressed Based on such a time 
25 determination, it is possible to locate the frame displayed in the broadcast program 
when the key was pressed. However, there are two requirements that must be met in 
order to enable selection of the correct frame within a negligible range of 
inaccuracy. The first requirement is that clocks of all sub-systems that are Involved 
in the determination of the exact time must be synchronized. Hie second 
30 requirement is that possible latencies in the sub-systems must be determined and 
taken into account The first and the second requirements are discussed below. 
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3*4.2.1 Clock Synchronization and Sub-systems Latencies 

3.4.2. 1 .1 Clock Synchronization 

If the clocks of alt the sub-systems that are involved in the 
determination of the time when the user has pressed the key are not synchronized, 
calculation of the time may be incorrect and the user may, in the end, get an 
incorrect frame. Therefore, a mechanism as is well known in the art which ensures 
that the clocks are synchronised may preferably be employed to synchronize the 
sub-systems clocks* An example of such a mechanism is the Network Tune Protocol 
(NTP) which is described at (he World Wide Web site www.ntp.org, 

3.4.2.1.2 Sub-systems Latencies 

Latencies in die sub-systems that are involved in the determination of 
the time when the user has pressed the key may also cause incorrect calculation of 
the time which in its turn may result in the user being provided an incorrect frame. 
Therefore, it is important to determine the latencies in advance and to take the 
latencies into account when determining die time when the user has pressed die key. 

Most of the sub-systems have latencies that are expected to be fairly 
constant over time, and constant for most users. Any remaining variance in the 
latencies is expected to be small and without significant influence on frame supply 
end results; 

The following sub-systems are expected to contribute latencies: 
telephone connections and the voice response system; video broadcast systems; and 
{Capture. 

Latencies contributed by die telephone connections and die voice 
response system are expected to be insignificant because telephone systems and 
voice response systems are designed a priori to successfully handle voice interaction 
and selections through key presses. 

If the voice response system involves a network connection between a 
hardware that handles telephone calls and a computer that is able to compute die 
time when the user has pressed the key f the network connection may contribute a 

network latency. If the network latency is fairly constant, the network latency may 
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be measured and subtracted* Networks, however, have a tendency to have a high 
variance in latencies if and only if traffic capacity of traffic passing in the networks 
is close to die maximal capacity. Thus, in order to avoid a high variance in latency, a 
network used for providing a snapshot service must have a traffic rapacity larger 
5 than what is normally needed If the network used for providing a snapshot service is 
to be shared, it is important to ensure exclusive reservation of ample capacity of the 
network for the snapshot service. 

Another latency that may be associated with the voice response 
system is related to whether the voice response system records a key press when the 
10 key is pressed down or when the key is released. In such a case, it is preferred, in 
terms of latency, to use a voice response system that records the key press when the 
key Is pressed down. 

Latencies of video broadcast systems depend on how video signals are 
broadcast to users and to iCapture which records the video signals. The video signals 
15 may, for example, be received at iCapture at a different time than at a user 
television. If iCapture receives the video signals from a source other than a source 
that broadcasts the video signals to the user television, it is possible to compute a 
resulting latency, or to find the resulting latency by performing user experiments. 

If users receive the video signals from different sources,, such as from 
20 a cable source and a satellite transmission, significant latencies in video reception 
may occur. In such a case, latencies of cable transmission and satellite transmission 
may be measured in advance and taken into account It is appreciated that the users 
will then preferably be required to input to the voice response system information 
regarding sources of video signals received at their televisions as mentioned above. 
25 Latency in iCapture between a time when the video signals are 

received and a time when a reception time of a desired frame is computed may also 
influence accuracy of the snapshot service. Depending on the structure and operation 
of iCaputre, such latency may be insignificant or measured in advance and token 
into account 
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Fig. 8 shows a simplified block diagram illustration of a system for 
providing a snapshot service in which there are various latencies that may be taken 
into account as described above. 

3,5 Request to Server 

5 When the user snapshots an image the UH> must be transmitted to 

(he server, which is responsible for constructing the background image and 
sending it to the mobile telephone. 

In the snapshot service that uses ITV the UID may be transmitted in 
two different ways. In the first way, the UID may be transmitted in a return-path 
- 10 via the modem of the STB, for example together with the specified telephone 
number for which the background Image or polyphonic ring tone is intended. 
Transmission via Ihe modem does not typically require intervention by the user. 
However, in a case where, for example, the return-path via the modem has low 
penetration or communication latency is high, transmission via the modem may 

IS not be practical and another way to transmit the UID is required. 

In the second way, the UID is preferably displayed on the television 
screen and the user is requested to send a text message, for example an SMS 
message including the UID to the server. The server may then return the correct 
background image. The disadvantage of using the second way is that the user must 

20 construct the text message manually with the risk of errors being introduced as a 
result of human errors. It is appreciated that the two ways may be combined so that 
by default the modem is used, but if an error occurs or the modem has not been 
connected, the user is presented with a message on the television screen requesting 
him/her to send a text message instead 

25 In the analog snapshot service there is no need for the user to 

actually transmit the UID to the server because the voice response system 
transmits the UID to the server. 
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3.6 Mobile Phone Application 

As mentioned above, a JAVA application may be sent along with the 
background image allowing the user to edit the downloaded snapshot. Preferably, 
the JAVA application may support requests for previous or following images to 
5 compensate for the slight risk of the user getting a different frame than what he/she 
really wanted, for example due to mistiming key presses on the remote control or 
the mobile telephone keypad. 

3.7 iCapture 

iCapture is preferably responsible for constructing a series of 
10 pictures based on an input of streaming video. The pictures must correspond to 
each frame in the video, so that it is possible to find a picture for each frame. 
Preferably, iCapture determines, for each frame in the video, whether the frame 
should be accessible to a snapshot service based, for example, on whether the 
frame- belongs to a portion of the video that is subject to an owner's rights and 
1 S there is an agreement with the owner regarding the snapshot service. 

Preferably, iCapture is also responsible for associating a UID to each 
frame, allowing searching and locating a required picture based on the UID. 
iCapture is further responsible for receiving and processing metadata, from a 
broadcaster and dispatching the metadata to other components in the service. 
20 The architecture of ICapture is depicted in Fig. 9A for a case in 

which iCapture deals with MPEG digital video, and in Fig. 9B for a case in which 
iCapture deals with analog video. It is however appreciated mat a combined 
architecture for iCapture (not shown) may alternatively be used for dealing with 
both digital video and analog video. 
25 Referring to Fig. 9A, an input streaming MPEG video is preferably 

inputted to iCapture in a clear form. If the MPEG video was originally encrypted at 
the headend, a decrypter (not shown) at the headend preferably decrypts the 
encrypted MPEG video prior to providing it to iCapture so that iCapture ultimately 
receives decrypted streaming MPEO video. 
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iCapture preferably includes the following elements: an MPEG 
decoder; a UID control unit; a metadata control unit; and a database interface that 
interfaces to the frame database. 

The MPEG decoder is preferably responsible for decoding the 

5 streaming MPEG video and for producing one output with the time code embedded 
in the frames and one with still pictures. The decoder needs to ensure that exactly 
one picture is generated for each frame. The MPEG decoder preferably validates 
the MPEG video so that it is possible to disable the snapshot service for portions of 
the MPEG video for which the snapshot service should not be allowed. The MPEG 

tO decoder also preferably divides the MPEG video Into MPEG sequences for storage 
in the frame database as mentioned above. The MPEG decoder preferably provides 
time codes to the UID control unit. 

The UID control unit Is responsible for synchronising the UID 
computed in iCapture with a UID maintained by the application running on a 

1 5 number of STBs. The UID control unit computes for each frame a time code of the 
frame and thus computes the UID. At the same time, the UID control unit monitors 
the time codes being broadcast to identify any discontinuities in the time codes. 
When a discontinuity occurs, the discontinuation counter is incremented. 

The metadata control unit is preferably operative to receive metadata ... 

20 from outer elements of the headend and to process the metadata. The metadata 
control unit preferably has two responsibilities: 

1. A responsibility for detennining whether the service should be available or 
not Based on such a determination, the metadata control unit sends data to 
the headend (and ultimately to the set-top boxes) which controls the 

25 availability of the service. 

2. A responsibility for constructing a reference between the UID and billing 
information, that is creating a link between the UID and a specific rights 
owner. 

The database interface is preferably operative to insert still pictures 
30 or MPEG sequences and metadata into the frame database. The content of the 
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database is preferably indexed by the UID so that it is possible to perform .lookups 
based on UlDs. 

Referring to Fig. 9B, iCapture that deals with analog video 
preferably includes the following elements: an analog frame decoder; an image 
5 converter; a UID control unit; a metadata control unit; and a database interlace that 
interfaces to the frame database. 

The analog frame decoder is preferably responsible for decoding 
inputted analog video and for preparing frames of the analog video for storage in 
the frame database. 

10 The image converter preferably converts the pictures from a format 

outputted from the analog frame decoder into a format best suitable for storage. 
For example, format conversion may change the size of the pictures to fit a size 
acceptable on mobile telephones. 

The UID control unit preferably determines the UID from frame 
15 timing information provided by the analog frame decoder. 

The metadata control unit and the database Interface may be similar 
to the metadata control unit and the database Interface in {Capture that deals with 
digital video. 

Since, in the snapshot service that uses ITV, the U£D is based on the. 
20 time code embedded in the MPEG video, some other way is required to compute 
the UID if the input for iCapture is a digital signal in a format other than MPEG. In 
such a case, some other type of timing information must be provided that can be 
used as a basis for computing the UID. Additionally, the MPEG decoder needs to 
be replaced by another appropriate device that is capable of decoding the format 
25 other than MPEG and generating still pictures from such a format. 

It is appreciated (hat various features of the invention which are, for 
clarity, described in the contexts of separate embodiments may also be provided In 
combination in a single embodiment. Conversely, various features of the invention 
which arc, for brevity, described in the context of a single embodiment may also be 
30 provided separately or in any suitable subcombination. 
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It will be appreciated by persons skilled In the art that the present 
invention is not limited by what has been particularly shown and described 
hereinabove. Rather the scope of the invention Is defined by the claims which 
follow: 
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What is claimed is: 

CLAIMS 

1. . A method for selecting an A/V section from an A/V program 
5 rendered on a television, the method comprising: 

pointing at the A/V section substantially when the A/V section is 
rendered on the television; and 

identifying said A/V section based on when said pointing was 
carried out with respect to rendering progress of the A/V program oh the 
10 television. 

2. Apparatus for selecting an A/V section from an A/V program 
rendered on a television, the apparatus comprising: 

a pointer enabling pointing at the A/V section substantially when the 
15 A/V section is rendered on the television; and 

a processor identifying said A/V section based on when pointing of 
the A/V section was carried out with respect to rendering progress of the A/V 
program on the television. 



20 3. Apparatus substantially as described hereinabove. 

4. Apparatus substantially as shown in the drawings. 

5. A method substantially as described hereinabove. 

25 

6. A method substantially as shown in the drawings. 

7. A system substantially as described hereinabove. 
30 8, A system substantially as shown in the drawings* 
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